t-test : Two Independent Samples

Two sample t-statistics: Participants from Separate group 

  • The two sets of data could come from two completely separate groups of participants.
  • The study may involve a sample of boys compared with a sample of men.
  • The study may compare JEE results of one group of participants who are given computers, with grades for a second group who are not given computers

t-test : Two Independent Samples


Prerequisite for better understanding

  • Sample variance
  • Standard error formulas
  • The t statistic
  • Distribution of t values
  • df for the t statistic
  • Estimated standard error

The Independent Samples t-test compares the means of two independent groups in order to determine whether there is statistical evidence that the associated population means are significantly different.


Two samples t-statistic | Independent or between-subjects design

A design that uses a separate group of participants for each treatment condition is called an independent-measures research design or a between-subjects design.

t-test : Two Independent Samples

  • The independent-measures t statistic is used for hypothesis testing.
  • The difference between the two sample means (µsample1 − µsample2) is used as a basis for testing a hypothesis about the difference between the two population means (µ1 − µ2).

Example: Two-sample t-statistic an Independent Measure test  

Population-1:

  • Treatment: Vegetation diet + Daily exercise for six months,
  • Population mean: Population mean before treatment is unknown (μ1 =?)
  • Sample: As we cannot practically analyse the whole population after treatment, therefore random samples are selected for analysis

t-test : Two Independent Samples

Population-2:

  • Treatment: Sea Food + No exercise for six months,
  • Population mean: Population mean before treatment is unknown (μ2 =?)
  • Sample: As we cannot practically analyse the whole population after treatment, therefore random samples are selected for analysis
  • Test: to determine any significant difference in health conditions among these two populations.t-test : Two Independent Samples

Two samples: t-statistic analysis:

  • Used to compare the effect of treatment on population 1 and population 2.
  • Samples are drawn from these two populations and a study is conducted.
  • As per the sample statistics, the inference is taken about the population and whether there is no difference in health conditions of both populations
  • Or there is a significant difference in the health of people of the two populations.

Summary

t-test : Two Independent Samples


Independent-Measure Test:

The objective is to evaluate the mean difference between two populations (or between two treatment conditions) and conclude whether two population/treatment has a significant effect or whether the variation is only because of chance.


Null hypothesis

  • The null hypothesis states that there is no effect, no difference or no change.

H0 : µ1 − µ2 = 0 

(No difference between the population means)

  • The null hypothesis could also be stated as

µ1 = µ2


The alternative hypothesis

  • The alternative hypothesis states that there is a mean difference between the two populations,

H1 : µ1 − µ2 ≠ 0

(Both means are different. Change is because of treatment on population)

  • The alternative hypothesis simply states that two population means are not equal:

µ1 ≠ µ2


Formula: 

Two sample t-statistic: Independent-measure Hypothesis Test

  • Remember the formula of one sample t statistic:-

t-test : Two Independent Samples

  • Refer above formula of ONE sample t statistics and re-write the below formula for TWO sample

t-test : Two Independent Samples

  • Sample statistic difference = difference between the mean of sample1 and sample2

= (µsample1 − µsample2)

  • hypothesized Population parameter difference= µ1 – µ2
  • Pooled estimated standard error = sp
  • t statistics =

t-test : Two Independent Samples

  • Considering H0 : µ1 – µ2 =0 

t-test : Two Independent Samples

t statistics


t-test : Two Independent Samples


Estimated standard error Formula – Independent-Measures Hypothesis Test

  • The standard deviation of sample 1 = s1
  • The Standard estimated of sample 2 = s2

t-test : Two Independent Samples

  • Add the above two errors together to get a total Estimated standard error for two samples Independent-Measures Hypothesis tests. The resulting formula for standard error is

t-test : Two Independent Samples


Two sample t-statistics: Degrees of Freedom 

  • df for the t statistic = df for the first sample + df for the second sample

t-test : Two Independent Samples


Two sample t-statistics: Steps for the hypothesis test

  • The independent-measures t statistic uses the data from two separate samples to help decide whether there is a significant mean difference between two populations or between two treatment conditions.

A complete example of a hypothesis test with two independent samples is as follows


Step 1: State the hypotheses and select the alpha level.

H0 : µ1 − µ2 = 0 (No difference.)

H1: µ1 − µ2 ≠ 0 (There is a difference.)

  • set α = .05. Directional hypotheses could be used and would specify whether the students who were tested in a dimly lit room should have higher or lower scores

Step 2: This is an independent-measures design.

The t statistic for these data has degrees of freedom determined by

df = df1 + df2

df = n1 + n2 – 2

    • n1= sample size of sample-1
    • n2= sample size of sample-2

Step 3: Calculate the test statistic.

Calculate t :

t-test : Two Independent Samples


Step4: Make a decision

  • If the obtained value t is in the critical region. Reject H0 and conclude that there is a significant difference
  • If the obtained value t is not in the critical region. Accept H0 and conclude that there is no difference, both are the same.

t-test : Two Independent Samples


Two sample t-statistic: Assumptions 

  • The observations within each sample must be independent
  • The two populations from which the samples are selected must be normal
  • The two populations from which the samples are selected must have equal variances

Two sample t-statistic: Hartley’s F-Max Test

  • If the two population variances are equal, then the two sample variances should be very similar. When the two sample variances are reasonably close, you can be reasonably confident that the homogeneity assumption has been satisfied and proceed with the test.
  • If one sample variance is more than three or four times larger than the other, then there is a reason for concern. A more objective procedure involves a statistical test to evaluate the homogeneity assumption.
  • Hartley’s F-max test is one of the simplest to compute and understand homogeneity of variance with more than two independent samples.
  • The F-max test is based on the principle that a sample variance provides an unbiased estimate of the population variance. The null hypothesis for this test states that the population variances are equal, therefore, the sample variances should be very similar. The procedure for using the F-max test is as follows:
  • Compute the sample variance for each of the separate samples  t-test : Two Independent Samples
  • Select the largest and the smallest of these sample variances and compute F

t-test : Two Independent Samples

  • A relatively large value for F-max indicates a large difference between the sample variances. In this case, the data suggest that the population variances are different and that the homogeneity assumption has been violated.
  • On the other hand, a small value of F-max (near 1.00) indicates that the sample variances are similar and that the homogeneity assumption is reasonable
  • The F-max value computed for the sample data is compared with the critical value found in Table.

Two sample t-statistic: F-max value Table

t-test : Two Independent Samples


  • If the sample value is larger than the table value, you conclude that the variances are different and that the homogeneity assumption is not valid. To locate the critical value in the table, you need to know
    • k = the number of separate samples. (For the independent-measures t-test, k = 2)
    • df = n − 1 for each sample variance.
    • The Hartley test assumes that all samples are the same size.
  • The alpha level. The table provides critical values for α = .05 and α = .01.

Two sample t-statistic: effect size estimate: Cohen’s d 

Effect measurement is defined by Cohen’s d : 

  • The difference between the two sample means (x1-x2) is used as the best estimate of the mean difference between the two populations,
  • Pooled standard deviation (the square root of the pooled variance) is used to estimate the population standard deviation :

t-test : Two Independent Samples

  • The formula for estimating Cohen’s d

t-test : Two Independent Samples

t-test : Two Independent Samples


Two sample t-statistics: effect size estimate by r2

It is identified as r2.

Cohen proposed criteria for evaluating the size of a treatment effect that is measured by r2.

t-test : Two Independent Samples

Cohen’s standards for interpreting r2 are shown in below Table

t-test : Two Independent Samples


Two sample t-statistics:Confidence Intervals

Measurement : estimating µ1 – µ2

  • A confidence interval is an alternative method for measuring and describing the size of the treatment effect.
  • The sample means difference (µsample1-µsample2) is used to estimate the population mean difference ( µ1 − µ2).

(µsample1-µsample2) = µdiff

t-test : Two Independent Samples


Two sample t statistics: Reporting the Results

There is a prescribed format for reporting the calculated value of the test statistic,

  • Degrees of freedom,
  • α Alpha level for the t-test.
  • t value
  • Confidence Interval
  • p- probability value (Acknowledges the possibility/probability of a Type I error)

Scroll to Top
logo
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.